Speaker adaptation of continuous density HMMs using multivariate linear regression

نویسندگان

  • Chris Leggetter
  • Philip C. Woodland
چکیده

1 2 1 1 n j j j n H À @ A AE @ A j j j j j j j j j j j j j dependent @ƒsA models —nd —d—pts the model p—r—mE eters to the new spe—ker ˜y tr—nsforming the me—n p—r—meters of the models with — set of line—r tr—nsE formsF „he tr—nsform—tions —re found using — m—xE imum likelihood ™riteri— whi™h is implemented in — simil—r f—shion to the st—nd—rd wv tr—ining —lgoE rithms for rwwsF fy using the s—me tr—nsform—E tion —™ross — num˜er of distri˜utions —nd pooling the tr—nsform—tion tr—ining d—t— m—ximum use is m—de of the —d—pt—tion d—t—F „his —llows the p—r—meters of —ll st—te distri˜utions to ˜e —d—ptedF ‚esults —re presented on the IHHH word e‚€e ‚esour™e w—nE —gement ‚wI d—t—˜—se using — ™ontinuous density q—ussi—n mixture rww system with ™rossEword triE phone modelsF i—™h st—te in — ™ontinuous density q—ussi—n mixture rww h—s —n output distri˜ution m—de up of — numE ˜er of mixture ™omponent densitiesF e st—te with mixture ™omponents ™—n ˜e exp—nded to p—r—lE lel single mixture ™omponent st—tesF „hus the ™—se of single mixture ™omponent st—tes is des™ri˜edD —nd the extension to multiple mixture ™omponents is str—ightE forw—rdF „he pro˜—˜ility density of st—te gener—ting — spee™h o˜serv—tion ve™tor of dimension is @ A a I @P A AE @IA where —nd AE —re the me—n —nd ™ov—ri—n™e reE spe™tively of the output distri˜ution of st—te F „he —d—pt—tion pro™edure is ˜—sed on reEestim—ting the me—ns of the st—te distri˜utions using — line—r tr—nsE form of the existing me—nF „hus it is —ssumed th—t in —d—pting from the ƒs system to the spe—ker —d—pted @ƒeA system the st—te tr—nsition pro˜—˜ilities —nd the ™ov—ri—n™es of the st—te distri˜utions do not ™h—ngeF „he ƒs me—ns —re m—pped to the unknown ƒh me—ns @ A ˜y — line—r regression tr—nsform estim—ted from the —d—pt—tion d—t—X a where is the @ C IA tr—nsform—tion m—trix —nd is the extended me—n ve™torX a ‘I “ sf —n individu—l regression m—trix is used for e—™h st—te using sm—ll —mounts of —d—pt—tion d—t— will reE sult in very poor estim—tes of the m—tri™esF „husD e—™h regression m—trix is —sso™i—ted with m—ny st—te distri˜utions —nd estim—ted from the ™om˜ined d—t—F „ying tr—nsform m—tri™es in this m—nner is simil—r in essen™e to the tying of st—tes or mixtures ‘U“ ‘V“ whi™h m m j n …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Adaptation for Continuous Density HMMs: A Review

This paper reviews some popular speaker adaptation schemes that can be applied to continuous density hidden Markov models. These fall into three families based on MAP adaptation; linear transforms of model parameters such as maximum likelihood linear regression; and speaker clustering/speaker space methods such as eigenvoices. The strengths and weaknesses of each adaptation family are discussed...

متن کامل

Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models

A method of speaker adaptation for continuous density hidden Markov models (HMMs) is presented. An initial speaker-independent system is adapted to improve the modelling of a new speaker by updating the HMM parameters. Statistics are gathered from the available adaptation data and used to calculate a linear regressionbased transformation for the mean vectors. The transformation matrices are cal...

متن کامل

Speaker normalization and adaptation based on linear transformation

We propose novel speaker independent (SI) modeling and speaker adaptation based on a linear transformation. An SI model and speaker dependent (SD) models are usually generated using the same preprocessing of acoustic data. This straightforward preprocessing causes a serious problem. Probability distributions of the SI models become broad and the SI models do not give good initial estimates for ...

متن کامل

Discriminative speaker adaptation with conditional maximum likelihood linear regression

We present a simplified derivation of the extended Baum-Welch procedure, which shows that it can be used for Maximum Mutual Information (MMI) of a large class of continuous emission density hidden Markov models (HMMs). We use the extended Baum-Welch procedure for discriminative estimation of MLLR-type speaker adaptation transformations. The resulting adaptation procedure, termed Conditional Max...

متن کامل

Robust speaker adaptation of continuous density HMMS using multilayer perceptron network

The performance of global affine and nonlinear transformations for speaker adaptation in a hidden Markov model (HMM) speech recognition system are compared in this paper. The nonlinear transformation was obtained with a multilayer perceptron network (MLP) which was trained during the adaptation process to transform the mean vectors of the HMMs such that the output probabilities of the HMMs for ...

متن کامل

Speaker adaptation using tree structured shared-state HMMs

This paper proposes a novel speaker adaptation method that exibly controls state-sharing of HMMs according to the amount of adaptation data. In our scheme, acoustic modeling is combined with adaptation to e ciently utilize the acoustic models sharing characteristics for adaptation. The shared-state set of HMMs is determined by using tree-structured shared-state HMMs created from the history rec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994